Constrained Parameter Estimation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Audio Signals

نویسندگان

  • KATSUTOSHI ITOYAMA
  • MASATAKA GOTO
  • HIROSHI G. OKUNO
چکیده

This paper describes a sound source separation method for polyphonic sound mixtures of music including both harmonic and inharmonic sounds, and constrained parameter estimation using standard MIDI files as prior information. The difficulties in dealing with both types of sound together have not been addressed in most previous methods that have focused on either of the two types separately, because the properties of these sounds are quite different. We therefore developed an integrated weighted-mixture model consisting of both harmonic-structure and inharmonic tone models. On the basis of the MAP estimation using the EM algorithm, we estimated all model parameters of this integrated model under several original constraints for preventing over-training and maintaining intra-instrument consistency. We confirmed that the integrated model increased the SNR by 1.5 dB.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Source Separation of Musical Instrument Sounds in Polyphonic Musical Audio Signal and Its Application

A change of music appreciation style from “listening to high fidelity (Hi-Fi) sounds” to “listening to preferred sounds” has emerged due to evolution of digital audio processing technology for the past years. Previously, many people enjoyed passive music appreciation: e.g., they buy CD and phonograph recordings or download mp3 audio files, set the disks or files to various media players, and hi...

متن کامل

Polyphonic Pitch Tracking Using Joint Bayesian Estimation of Multiple Frame Parameters

We present a novel approach to pitch estimation and note detection in polyphonic audio signals. We pose the problem in a Bayesian probabilistic framework, which allows us to incorporate prior knowledge about the nature of musical data into the model. We exploit the high correlation between model parameters in adjacent frames of data by explicitly modelling the frequency variation over time usin...

متن کامل

PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals

This paper describes a real-time method, called PreFEst (Predominant-F0 Estimation method), for estimating the fundamental frequency (F0) of simultaneous sounds in monaural polyphonic audio signals. Without assuming the number of sound sources, PreFEst can estimate the relative dominance of every possible harmonic structure in the input mixture. It treats the mixture as if it contains all possi...

متن کامل

A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals

In this paper I introduce a method, called PreFEst, for estimating the fundamental frequency (F0) of simultaneous sounds in monaural polyphonic audio signals. Most previous F0-estimation methods have had difficulty dealing with such complex audio signals because these methods were designed to deal with mixtures of only a few sounds. Without assuming the number of sound sources, PreFEst can esti...

متن کامل

Harmonic Temporal Structured Clustering for Multiple Fundamental Frequency Estimation

This abstract describes a method for the Multiple Fundamental Frequency (F0) Estimation and Tracking task in the Music Information Retrieval Evaluation eXchange (MIREX) 2007. The method is called Harmonic Temporal Structured Clustering (HTC), which is a kind of constrained Gaussian Mixture Model (GMM) estimation using EM algorithm. It can jointly extract F0, intensity, onset, duration of each u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010